Policy Improvement for several Environments
نویسندگان
چکیده
In this paper we state a generalized form of the policy improvement algorithm for reinforcement learning. This new algorithm can be used to ...nd stochastic policies that optimize single-agent behavior for several environments and reinforcement functions simultaneously. We ...rst introduce a geometric interpretation of policy improvement, de...ne a framework to apply one policy to several environments, and propose the notion of balanced policies. Finally we explain the algorithm and present examples.
منابع مشابه
Policy Improvement for several Environments Extended Version
In this paper we state a generalized form of the policy improvement algorithm for reinforcement learning. This new algorithm can be used to ...nd stochastic policies that optimize single-agent behavior for several environments and reinforcement functions simultaneously. We ...rst introduce a geometric interpretation of policy improvement, de...ne a framework to apply one policy to several envir...
متن کاملApproximate Policy Iteration for several Environments and Reinforcement Functions
We state an approximate policy iteration algorithm to find stochastic policies that optimize single-agent behavior for several environments and reinforcement functions simultaneously. After introducing a geometric interpretation of policy improvement for stochastic policies we discuss approximate policy iteration and evaluation. We present examples for two blockworld environments and reinforcem...
متن کاملIntra Sector Policy Interventions for Improvement of Iranian Health Financing System
Background and purpose: To determine an appropriate financial model for the health system of Iran, several studies have been conducted. But it seems that these studies were not comprehensive and further investigation is required. So to design a valid and enforceable mechanism, the study of policy interventions will be considered through consensus of all stakeholders. This investigation was done...
متن کاملHow Neoliberalism Is Shaping the Supply of Unhealthy Commodities and What This Means for NCD Prevention
Alcohol, tobacco, and unhealthy foods contribute greatly to the global burden of non-communicable disease (NCD). Member states of the World Health Organization (WHO) have recognized the critical need to address these three key risk factors through global action plans and policy recommendations. The 2013-2020 WHO action plan identifies the need to engage economic, agricultural and other relevant...
متن کاملMultisectoral Actions for Health: Challenges and Opportunities in Complex Policy Environments
Multisectoral actions for health, defined as actions undertaken by non-health sectors to protect the health of the population, are essential in the context of inter-linkages between three dimensions of sustainable development: economic, social, and environmental. These multisectoral actions can address the social and economic factors that influence the health of a population at the local, natio...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2001